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AMENDMENT TO THE CLAIMS 

Please amend the claims as follows: 

1. (Canceled) 

2. (Currently Amended) A method for constructing a repeat database within a single species 
comprising: 

selecting a query sequence; 
selecting known repeat sequences; 

adding known repeat sequences into a repeat sequence database; 
masking said query sequence with repeat sequences in the repeat sequence database to 
create a contig assembly; 

testing said masked query sequence with a redundant file; 

identifying sequences in the redundant file that contain a similar sequence to a portion of 
the query sequence, wherein said identified sequences and said similar portion of the query 
sequence make up a pairwise sequence alignment; 

aligning all the identified pairwise sequence alignments; 

designating the right and left endpoints of each identified sequence and any intervening 
sequences; 

identifying a position within the query sequence corresponding to each endpoint; 
defining regions within the query sequence, wherein a region is a sequence between two 
consecutive positions matching two endpoints; 

identifying any two successive regions having a large variance in the number of sequence 
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matches; and 

adding the sequence within the region of the two successive regions having the highest 
number of sequence matches into the repeat sequence database. 

3. (Original) The method of claim 2, wherein the large variance in the number of sequence 
matches is equal to 5 or more. 

4. (Canceled) 

5. (Previously Amended) The method of claim 2, wherein said sequence is a 
deoxyribonucleotide sequence. 

6. (Previously Amended) The method of claim 2, wherein said sequence is a ribonucleotide 
sequence. 

7. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
animal DNA or RNA. 

8. (Original) The method of claim 7, wherein said animal is a human. 

9. (Original) The method of claim 8, wherein said animal is a mouse. 

10. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
plant DNA or RNA. 
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11. (Original) The method of claim 10, wherein said plant is a single-cell plant. 

12. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
fungal DNA or RNA. 

13. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
DNA or RNA of a microorganism or virus. 

14. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
DNA or RNA of a single-cell eukaryote. 

15. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
synthetic man-made DNA or RNA. 

16. (Previously Amended) The method of claim 2, wherein said sequences are postulated 
based upon amino acid sequences. 

17. (Original) The method of claim 2, wherein said database is encoded in a biological 
medium. 

18. (Original) The method of claim 2, wherein said database is encoded in a written medium. 

19. (Original) The method of claim 2, wherein said database is encoded in an electronic 
medium. 
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20. (Original) The method of claim 19, wherein said electronic medium is a computer- 
readable medium. 

2 1 . (Original) The method of claim 20, wherein said computer-readable medium is 
addressable through an internet connection. 

22. (Previously Amended) The method of claim 2, wherein said redundant file is a Public 
Domain Database. 

23. (Original) The method of claim 22, wherein said Public Domain Database is GenBank. 

24. (Original) The method of claim 22, wherein said Public Domain Database is dbEST. 

25. (Original) The method of claim 22, wherein said Public Domain Database is TIGR. 

26. (Original) The method of claim 22, wherein said Public Domain Database is SwissProt. 

27. (Previously Amended) The method of claim 2, wherein sequence comparisons are carried 
out using a Database Search Algorithm. 

28. (Original) The method of claim 27, wherein said Database Search Algorithm is BLAST. 

29. (Original) The method of claim 27, wherein said Database Search Algorithm is FASTA. 
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30. (Original) The method of claim 27, wherein said Database Search Algorithm is Smith- 
Waterman. 

3 1 . (Previously Amended) The method of claim 2, wherein said sequence comparisons are 
carried out utilizing a Scoring Matrix Program. 

32. (Original) The method of claim 3 1 , wherein said Scoring Matrix Program is PAM. 

33. (Original) The method of claim 31, wherein said Scoring Matrix Program is BLOSUM. 
34- 38. (Canceled) 

39. (Withdrawn) An improved database of nucleotide sequences, the improvement consisting 
of repeat sequences containing a similar sequence to a portion of a query sequence, wherein said 
identified sequences and said similar portion of the query sequence make up a pairwise sequence 
alignment, and wherein all identified pairwise sequence alignments have right and left endpoints 
of each identified sequence and any intervening sequences. 

The amendments entered into the case above are fiilly supported by the specification as follows: 

• p. 9, line 5-14 (pointing out the difficulty with public databases containing multiple 
sequence in the same species that the present invention overcomes by masking) 

• p. 13, line 19 (defining the contig) 

• p. 13-14, lines 21-23 and linesl-2, respectively (defining the contig assembly or map) 
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• p. 15, lines 4-23 (specifically stating the preferred within species repeat database 
construction) 

• p. 16-17, lines 18-23 and lines 1-6, respectively (creation of the contig assembly) 

STATUS OF THE CLAIMS 

1. (Canceled) 

2. (Currently Amended) A method for constructing a repeat database within a single species 
comprising: 

selecting a query sequence; 
selecting known repeat sequences; 

adding known repeat sequences into a repeat sequence database; 
masking said query sequence with repeat sequences in the repeat sequence database to 
create a contig assembly; 

testing said masked query sequence with a redundant file; 

identifying sequences in the redundant file that contain a similar sequence to a portion of 
the query sequence, wherein said identified sequences and said similar portion of the query 
sequence make up a pairwise sequence alignment; 

aligning all the identified pairwise sequence alignments; 

designating the right and left endpoints of each identified sequence and any intervening 
sequences; 
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identifying a position within the query sequence corresponding to each endpoint; 

defining regions within the query sequence, wherein a region is a sequence between two 
consecutive positions matching two endpoints; 

identifying any two successive regions having a large variance in the number of sequence 
matches; and 

adding the sequence within the region of the two successive regions having the highest 
number of sequence matches into the repeat sequence database. 

3. (Original) The method of claim 2, wherein the large variance in the number of sequence 
matches is equal to 5 or more. 

4. (Canceled) 

5. (Previously Amended) The method of claim 2, wherein said sequence is a 
deoxyribonucleotide sequence. 

6. (Previously Amended) The method of claim 2, wherein said sequence is a ribonucleotide 
sequence. 

7. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
animal DNA or RNA. 

8. (Original) The method of claim 7, wherein said animal is a human. 
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9. (Original) The method of claim 8, wherein said animal is a mouse. 

10. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
plant DNA or RNA. 

11. (Original) The method of claim 10, wherein said plant is a single-cell plant. 

12. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
fungal DNA or RNA. 

13. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
DNA or RNA of a microorganism or virus. 

14. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
DNA or RNA of a single-cell eukaryote. 

15. (Previously Amended) The method of claim 2, wherein said sequences are derived from 
synthetic man-made DNA or RNA. 

16. (Previously Amended) The method of claim 2, wherein said sequences are postulated 
based upon amino acid sequences. 

17. (Original) The method of claim 2, wherein said database is encoded in a biological 
medium. 
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18. (Original) The method of claim 2, wherein said database is encoded in a written medium. 

19. (Original) The method of claim 2, wherein said database is encoded in an electronic 
medium. 

20. (Original) The method of claim 19, wherein said electronic medium is a computer- 
readable medium. 

21 . (Original) The method of claim 20, wherein said computer-readable medium is 
addressable through an internet connection. 

22. (Previously Amended) The method of claim 2, wherein said redundant file is a Public 
Domain Database. 

23. (Original) The method of claim 22, wherein said Public Domain Database is GenBank. 

24. (Original) The method of claim 22, wherein said Public Domain Database is dbEST. 

25. (Original) The method of claim 22, wherein said Public Domain Database is TIGR. 

26. (Original) The method of claim 22, wherein said Public Domain Database is SwissProt. 

27. (Previously Amended) The method of claim 2, wherein sequence comparisons are carried 
out using a Database Search Algorithm. 
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28. (Original) The method of claim 27, wherein said Database Search Algorithm is BLAST. 

29. (Original) The method of claim 27, wherein said Database Search Algorithm is FASTA. 

30. (Original) The method of claim 27, wherein said Database Search Algorithm is Smith- 
Waterman. 

3 1 . (Previously Amended) The method of claim 2, wherein said sequence comparisons are 
carried out utilizing a Scoring Matrix Program. 

32. (Original) The method of claim 31, wherein said Scoring Matrix Program is PAM. 

33. (Original) The method of claim 31, wherein said Scoring Matrix Program is BLOSUM. 
34-38. (Canceled) 

39. (Withdrawn) An improved database of nucleotide sequences, the improvement consisting 
of repeat sequences containing a similar sequence to a portion of a query sequence, wherein said 
identified sequences and said similar portion of the query sequence make up a pairwise sequence 
alignment, and wherein all identified pairwise sequence alignments have right and left endpoints 
of each identified sequence and any intervening sequences. 
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